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APPARATUS AND METHOD FOR DATA PROCESSING USING MULTIPLY- 
ACCUMULATE INSTRUCTIONS 



S CROSS-REFERENCE TO RELATED APPLICATION 

This application claims the priority benefit of provisional application serial no. 
60/212,954, filed June 21, 2000 ? the full disclosure of which is incorporated herein by 
reference- 



B ACKGROUND OF THE INVENTION 

Field of Invention 

The present invention relates to an apparatus and a method of data processus 
system. More particularly, the present invention relates to an apparatus and a method 
of data processing system that uses multiply-accumulate instructions. The cat a 
processing system can easily detect an overflow case and simplify calculations to save 
execution time. 

Description of Related Art 

In the field of data processing, it is necessary to be able to perform certain 
20 operations upon operands stored in various data registers. One such operation is to 
multiply an N-bit operand by a second N-bit operand and add a third N-bit operand to 
the result. Another similar operation is to multiply an N-bit operand by a second K-bi t 
operand and add a 2N-bit operand to the result. 

Figure 1 is a block diagram showing a conventional multiply-accumulator iuit 
25 for a data processing system in US patent No. 5,583,804, titled "DATA PROCESSING 

l 
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USING MULTIPLY-ACCUMULATE INSTRUCTIONS". This system is capable of 
performing a first class of multiply-accumulate instructions in the form of 
N*N+2N-»2N } ^ a second class of multiply-accumulate instructions in the form of 
N*N+N->N. 

5 The multiply-accuraulator unit comprises a first data register 10, a second data 

register 20, an N*N multiplier 30, an N+N accumulator 40, and a 2N+2N accumulator 
50, The multiplier 30 is capable of calculating N*N to get an N or a 2N result. The 
N+N accumulator 40 is capable of calculating N+N to get an N result. The 2N+2N 
accumulator 50 is capable of calculating 2N+2N to get a 2N result. 

10 However, it is possible while performing an operation on an N*N-N->N class 

instruction, that the final N result is greater than can be represented in a result of N-bit 
size. It is important when this situation happens, that the user be made aware that an 
overflow condition has occurred in the operation. A disadvantage of the multiply, 
accumulator unit of Figure 1 is the unit's inability to show this overflow condition. To 

IS provide this critical information to the user in as efficient manner as possible is one 
reason the multiply-accumulator unit of the present invention was developed. 

SUMMARY OF THE INVENTION 
Accordingly, an object of the present invention is to provide a data processing 
20 system comprising a single accumulator and capable of detecting overflow conditions 
and capable of performing multiply-accumulate instructions. Hence, syr.em 
architecture is simplified and at the same time, valuable overflow information is 
provided. The data processing system can easily detect an overflow case arid simplify 
calculations to save execution time. 
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To achieve these and other advantages and in accordance with the purpose of tho 
invention, as embodied and broadly described herein, the invention provides as 
apparatus for processing data, the apparatus comprising a first register bank of N-bit 
data processing registers, a second register bank of N-bit data processing registers, a 
5 selector, a multiplier and an accumulator. The selector is coupled to the first register 
bank and the second register bank and is used for selecting one of the first and second 
register banks and outputting N-bit data from the selected register banks. The 
outputted N-bit data and the N-bit data held in the second register bank form a 2N-bit 
addition operand. The multiplier is used for performing multiply operation upon a first 
10 operand and a second operand and outputting an 2N-bit multiplied result. The 
accumulator is coupled to the multiplier, the selector and the second register bank and is 
used for performing accumulate operation upon the 2N-bit multiplied result asd the 2N- 
bit addition operand and outputting a 2N-bit accumulated result. 

In the apparatus for processing data, the selector is further used for receiving a 
15 class signal, wherein the selector selects one of the first and second register banks in 
response to the class signal 

The class signal of the above-mentioned apparatus for processing data is used 
for indicating a first class of instruction or a second class of instruction. 

The apparatus for processing data further comprises a detecting device, coupled 
20 to the accumulator, for receiving the 2N~bit accumulated result and for checking if a 
case of overflow occurs. 

In the apparatus for processing data, the outputted N-bit data from the selector 
and the N-bit data held in the second register bank are formed in combination as a first 
N-bit part and a second N-bit part of the 2N-bit addition operand^ the accumulated result 
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includes a third N-bit part and a forth N-bit part, when the class signal i$ the second 
class of instruction, the detecting device comparing the first N-bit part of the 2N-bit 
addition operand and the third N-bit part of the accumulated result to determine if the 
case of overflow occurs. 
5 To achieve these and other advantages and in accordance with the purpose of the 

invention, a$ embodied and broadly described herein, the invention provides a method 
for processing data using an apparatus having a first register bank of N-bit data 
processing registers, a second register bank of N-bit data processing registers, a selector, 
a multiplier and an accumulator, the method comprising selecting one of the fiist and 
: ^ 10 second register banks and outputting N-bit data from the selected register banks, 
t\ wherein the outputted N-bit data and the N-bit data held in the second register bank 

:"S form a 2N-bit addition operand; performing multiply operation upon a first operand and 

K a second operand and outputting an 2N-bit multiplied result; performing accumulate 

.7 operation upon the 2N-bit multiplied result and the 2N-bit addition operand and 

U 1 5 outputting a 2N-bit accumulated result. 

; ;q In the above-mentioned method for processing data, the step of selecting one of 

q the first and second register banks and outputting N-bit data from the selected register 

banks further comprising a step of receiving a class signal is determined by a class 

signal received by the selector. 
20 The class signal is used for indicating a first class of instruction or a second class 

Of instruction. 

In the above-mentioned method for processing data further comprises a step of 
receiving the 2N-bit accumulated result and checking if a case of overflow occurs. 
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In the above-mentioned method for* processing data, the outputted N-bit data 
from the selector and the N-bit data held in the second register bank are formed in 
combination as a first N-bit part and a second N-bit part of the 2N-bit addition operand, 
the accumulated result includes a third N-bit part and a forth N-bit part, when the class 

5 signal is the second class of instruction, comparing the first N-bit part of the 2N-bit 
addition operand and the third N-bit part of the accumulated result to determine if the 
case of overflow occurs. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary, and are intended to provide fiirthei 

10 explanation of the invention as claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The accompanying drawings are included to provide a further "understanding of 
the invention, and are incorporated in and constitute a part of this specification. The 
15 drawings illustrate embodiments of the invention and, together with the description, 
serve to explain the principles of the invention, In the drawings, 

Figure 1 is a block diagram showing a conventional multiply-accumulator "unit 
for a data processing system; and 

Figure 2 is a block diagram showing a multiply-accumulator unit for a data 
20 processing system according to the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Reference will now be made in detail to the preferred embodiments of the present 
invention, examples of which are illustrated in the accompanying drawings. Wherevei 
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possible, the same reference numbers are used in the drawings and the description to 
refer to the same or like parts. 

Refer to Figure 2 ? which is a block diagram showing a multiply-accumulator 
unit 200 for a data processing system according to the present invention. The 
5 midtiply-accumulator unit 200 includes a selector 160, a multiplier 130, and an 
accumulator 150. Data are supplied from a special register bank 110 and a general 
register bank 120. The multiplier 1 30 is coupled to the accumulator 150. The special 
register bank 110 and the general register bank 120 are coupled to the selector 160. 
The selector 160 is coupled to the accumulator 150. The general register bank 12C is 
10 also directly coupled to the accumulator 150, The multiply-accumulator unit 200 
further includes a detecting device 170 coupled to the accumulator 150. 

The multiplier 130 is capable of multiplying two N-bit operands and then gets a 
2N-bit multiplied result 134. For example, N*N to get a 2N result. As shown in 
Fig.2, a first N-bit operand denoted as E and a second N-bit operand denoted as F are 
15 inputted into the multiplier 130. A 2N-bit multiplied result 134 is generated by the 
multiplier 130. The multiplied result 134 from multiplier 130 is sent to the 2N-bit 
accumulator 150 and is added by an addition operand- The addition operand is also 
2N bits including a first N-bit part and a second N-bit pan. In the invention, just one 
accumulator is necessary to provide more desired calculations. For example, in the 
20 prior art, if two calculations such as N*N+N->N and N*N+2N->2N are desired, there 
are at least two accumulators are necessary to provide these desired calculations. In 
the architecture of the invention, only one accumulator is necessary for the two 
calculations. The circuits as shown in Fig. 2 can implement the feature. 
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In the invention, a class signal 1G0 is provided to the selector 160 to indicate one 
of two different classes of instructions. The class signal 100 indicates which class of 
instructions is being operated on. For example, a first class of instructions such as 
N*N+2N^>2N or a second class of instructions such as N*N-HtON. The class signal 
5 is set by a decoding instruction supplied to the multiply-accumulator unit 200, The 
first class of instruction needs more execution time and a more precise result is 
generated therefrom. The second class of instruction needs less execution time tten 
the first class of instruction. 

When the class of instructions is the first class, i.e., if the desired calculation ; s 
10 N*N+2N-^2N, the class signal 100 causes the selector 160 to provide data from the 
general register bank 120 to the accumulator 150. That is, the 2N bits of the addition 
signal 152 includes (N, N). The first N-bit part of the addition signal 152 is denoted 
by C, and the second N-bit part of the addition signal 152 is denoted by D. The first 
N-bit part C is provided by the general register bank 120. The second N-bit part D is 
1 5 also provided from the general register bank 1 20, as shown in Fig.2. 

When the class of instructions is the second class, i.e., N*N+N->N, the class 
signal 100 causes the selector 160 to provide data from the special register bank 110 to 
the accumulator 150, That is, the special register 110 provides the first N-bit part C of 
the addition operandl52. The special register bank 1 10 can be accessed by users u ade*: 
20 software control . The second N-bit part D is provided from the general register bank 
120. In the multiply-accumulator unit 200 of the preferred embodiment of the 
invention, a 2N-bit (152) will be added by the accumulator 150 and a 2W-bit 
accumulated result will be generated, no matter the calculation of N*N+N-»N ;;ust 
needs a N-bit data to be added and a N-bit accumulated result to be generated. The 
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architecture of the invention provides some advantages. For example,, a case cf 
overflow will be easily detected in such design. The details will be described below. 
Another advantage is simplifying some calculations to save execution time. For 

example, if a calculation such as ^£XJ k *X 9 Y < > + X i Y x + + XJ„is desired, the 

5 comparison between the preferred embodiment and the prior art is as followed; 
In the prior art, the program language is: 
for(k-0;k>n;k++) { 
Move X k to 1^ 
Move Y k to R l 

10 R^VRi + &2 ; mla Ro, r;, R, 

k=k+l 

} 

wherein the "MLA" is an instruction for N*N+N=N, the result after executing 
the ,r MLA" instruction is 32-bit in length. 
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In the preferred embodiment, the progr$p language is: 
for(kK);k£n;k++) { ^ 

Move X k to t> 

Move Y k to Ri " ** 

(Rcp , R 2 )=R 0 *R 1 + (Rep , R 2 ); MLA R., R,, R„ R, 

k=k+l 

} 

wherein the "MLA" is an instruction for N*N+N=N. the result after execucirg 
the "MLA" instruction in the program of the invention is 64-bit m length. 
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In the preferred embodiment of the invention, a result of summation is 6 4-1: it 
and the prior art is 32-bit If a 64-bit result is required in the prior art, a calculation ;: f 
N*N+2N-^2N is necessary, which needs more execution time than that in the 
embodiment. That is ? in the invention a simplified calculation is necessary for we 
5 same as in the prior ait. 

After accumulation, the accumulator 150 generates an accumulated result 154. 
The accumulated result 154 is 2N-bit and includes a first N-bit part H and a second N- 
bit part I. The accumulated result 154 will be an output result of the multiply- 
accumulator unit 200. The accumulated result 1 54 also can be outputted to a detection 
10 device 170 to detect a case of overflow, which is indicated by a detective result 172, 

In such a case that the class of instructions is the second class, i.e., calculator of 
N*N+JON is desired. The detection device 170 will compare the first N-bit part H of 
the accumulated result 154 with the first N-bit part C of the addition operand 1:52, 
which is supplied by the special register bank 110. If the first N-bit part H of the 
15 accumulated result 154 is not the same as the fust N-bit part C of the addition operand 
152, it means that an overflow case is occurred in this calculation. 

For clarity, the two N-bit operands that inputted into the multiplier 130 are 
respectively denoted as E and F. The calculation of N*N+N-^N instruction can be 
implemented by the invention as E*F+CD-^HI. In this case that C is not equal tc the 
20 H, that means that an overflow occurs. The calculation of N*N+2N-^2N instruction 
can also be implemented by the same architecture of the invention as B*F-rC0->HL 

For an N*N+2N-^2N class instruction, the accumulator adds CD to die result of 
the E*F multiply operation, to get an accumulated HI result. Remembering th&- Ihe 
accumulator only performs one type of calculation, 2N+2N->2N. For an N*N+K ~>X 
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class instruction the accumulator adds CD to the result of the £*F multiply operation, to 
get an HI result, C ? in this case, is provided by the special register bank 1 10. If after 
the addition operation H does not equal C, then there is an overflow. 

This overflow indicator provides u$ers with useful information in a quick and 

5 convenient manner whereas the multiply-accumulator unit of Figure 1 does not provide 
this overflow indication. This is another advantage of the present invention. 

It will be apparent to those skilled in the art that various modifications *iid 
variations can be made to the structure of the present invention without departing from 
the scope or spirit of the invention. In view of the foregoing, it is intended that the 

10 present invention cover modifications and variations of this invention provided they fall 
within the scope of the following claims and their equivalents. 
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WHAT IS CLAIMED IS: 

L An apparatus for processing data, said apparatus comprising; 

a special register bank of N-bit data processing registers; 

a general register bank of N-bit data processing registers; 
5 a selector, coupled to the special register bank and the general register bank, foi 

selecting one of the special and general register banks and outputiing a selected N-bit 
result from the selected register bank, wherein the selected N-bit result and a N-bit data 
form a 2N-bit addition operand; 

a multiplier for performing multiply operation upon a first operand and a second 
10 operand and outputting an 2N-bit multiplied result; 

an accumulator, coupled to the multiplier, the selector and the general register 
bank, for performing accumulate operation upon the 2N-bit multiplied lesult and \h$ 
2N-bit addition operand and outputting a 2N-bit accumulated result. 

2, The apparatus for processing data of claim 1 ? wherein the N-bit data is helc in 
1 5 the general register bank. 

3, The apparatus for processing data of claim 1, the selector further leceivm^ a 
class signal, wherein the selector selects one of the special and general register banks in 
response to the class signal. 

4, The apparatus for processing data of claim 3, the class signal is used for 
20 indicating a first class of instruction or a second class of instruction, wherein the first 

class of instruction is executing a first calculation of N*N+2N~>2N arid the second 
class of instruction is executing a second lalculation of N*N+N->N. 
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5. The apparatus for processing data of claim 4, the apparatus further comprise g 
a detecting device, coupled to the accumulator, for receiving the 2N-bit accumulated 
result and for checking if a case of overflow occurs. 

6. The apparatus for processing data of claim 5 7 wherein 

5 the outputted N-bit result from the selector and the N-bit data held in the general 

register bank are formed in combination as a first N-bit part and a second N-bit part of 
the 2N-bit addition operand, 

the accumulated result includes a third N-bit part and a forth N-bit part, 
when the class signal is the second class of instruction, the detecting device 
10 comparing the first N-bit part of the 2N-bit addition operand and the third N-bit part of 
the accumulated result to determine if the case of overflow occurs. 

7. The apparatus for processing data of claim L the apparatus further comprising 
a detecting device, coupled to the accumulator, for receiving the 2N-bit accumulated 
result and for checking if a case of overflow occurs. 

15 8. The apparatus for processing data of claim 7, wherein 

the outputted N-bit result from the selector and the N-bit data are formed in 
combination as a first N-bit part and a second N-bit part of the 2N~bit addition operand, 

the accumulated result includes a third N-bit part and a forth N-bit part, 

when the class signal is the second class of instruction, the detecting device 
20 comparing the first N-bit part of the 2N-bit addition operand and the third N-bit p -irt of 
the accumulated result to determine if the case of overflow occurs. 

9, A method for processing data using an apparatus having a special register 
bank of N-bit data processing registers, a general register bank of N-bit data processing 
registers, a selector, a multiplier and an accumulator, the method comprising: 
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selecting one of the special and general register banks and outputting a N-Kt 
result from the selected register bank, wherein the N-bit result and a N-bit data form a 
2N-bit addition operand; 

performing multiply operation upon a first operand and a second operand and 
5 outputting an 2N-bit multiplied result; 

performing accumulate operation upon the 2N-bit multiplied result and the 2N- 
bit addition operand and outputting a 2N-bit accumulated result. 

10. The method for processing data of claim 9 ? wherein the N-bit data is helc in 

the general register bank. 
10 11. The method for processing data of claim 10, wherein the step of selecting 

one of the special and general register banks and outputting N-bit data from the selected 
register banks further comprising a step of receiving a class signal is determined by a 
class signal received by the selector 

12, The method for processing data of claim 11, the class signal is used for 
15 indicating a first class of instruction or a second class of instruction, wherein the first 

class of instruction is executing a first calculation of N*N+2N->2N and the second 
class of instruction is executing a second calculation of N*N+N->N. 

13. The method for processing data of claim 12, further comprising a 6t E-p o*r 
receiving the 2N-bit accumulated result and checking if a case of overflow occurs,, 

20 14, The method for processing data of claim 1 3, wherein 

the oxitputted N-bit result from the selector and the N-bit data held in the genera] 
register bank are formed m combination as a first N-bit part and a second N-bit pan of 
the 2N-bit addition operand, 

the accumulated result includes a third N-bit part and a forth N-bit part, 
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when the class signal is the second class of instruction, comparing the first N-tut 
pan of the 2N-bit addition operand and the third N-bit part of the accumulated result to 
determine if the case of overflow occurs. 

15. The method for processing data of claim 9 ? further comprising a step of 
5 receiving the 2N-bit accumulated result and checking if a case of overflow occurs. 

16. The method for processing data of claim 15, wherein 

the outputted N-bit result from the selector and the N-bit data are formed in 
combination as a first N-bit part and a second N-bit part of the 2N-bit addition operand, 

the accumulated result includes a third N-bit part and a forth N-bit part, 
10 when the class signal is the second class of instruction, comparing the fust N-bit 

part of the 2N-bit addition operand and the third N-bit part of the accumulated result to 
determine if the case of overflow occurs. 
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ABSTRACT OF THE DISCLOSURE 
An apparatus and a method of data processing system that uses multiply- 
accumulate instructions. The apparatus for processing data, the apparatus comprising 
a special register bank of N-bit data processing registers, a general register bank of N- 

5 bit data processing registers, a selector, a multiplier and an accumulator. The selector 
is coupled to the special register bank and the general register batik and is used ki 
selecting one of the special and general register banks and outputting N-bit data from 
the selected register banks. The outputted N-bit data and the N-bit data held in the 
general register bank form a 2N-bit addition operand The multiplier is used for 

10 performing multiply operation upon a first operand and a second operand and outputting 
an 2N-bit result The accumulator is coupled to the multiplier, the selector and the 
general register bank and is used for performing accumulate operation upon the 2N~bit 
result and the 2N-bit addition operand and outputting a 2N-bit accumulated result. 
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